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The majority of spectral imagery classifiers make a decision based on information 
from a particular spectrum, often the mean, which best represents the spectral signature 
of a particular target. It is known, however, that the spectral signature of a target can 
vary significantly due to differences in illumination conditions, target shape, and target 
material composition. Furthermore, many targets of interest are inherently mixed, as is 
the case with camouflaged military vehicles, leading to even greater variability. 

In this thesis, a detailed statistical analysis is performed on HYDICE imagery of 
Davis Monthan Air Force Base. Several hundred pixels are identified as belonging to 
one of eight target classes and the distribution of spectral radiance within each group is 
studied. It has been found that simple normal statistics do not adequately model either 
the total radiance or the single band spectral radiance distributions, both of which can 
have highly skewed histograms even when the spectral radiance is high. Goodness of fit 
tests are performed for maximum likelihood normal, lognormal, gamma, and Weibull 
distributions. It was discovered that lognormal statistics can model the total radiance and 
many single-band distributions reasonably well, possibly indicative of multiplicative 
noise features in remotely sensed spectral imagery. 
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I. INTRODUCTION 



Hyperspectral imagery (HSI) has been developed as a potentially powerful tool 
for classifying scenes and identifying targets in remote sensing. With instruments such as 
the Hyperspectral Digital Imaging Collection Experiment (HYDICE) and the Airborne 
Visible/Infrared Imaging Spectrometer (AVIRIS) that measure spectral radiance in over 
200 bands in the visible and near infrared portions of the spectrum at each pixel in a 
scene, the capability exists to identify targets that would not be detectable in lower- 
spectral resolution images. Many powerful classification strategies have been developed 
that use information about potential targets and backgrounds. However, the majority of 
such strategies, including spectral angle mapping (SAM), spectral feature fitting, matched 
filtering, linear unmixing, and subspace projection techniques are all quasi-deterministic. 
That is, the techniques are derived based upon certain representative (often mean) 
spectral signatures. When variations are present, simple assumptions are made about the 
distribution, and a thresholding operation is performed. While many quasi-deterministic 
strategies have been shown to be relatively successful, little effort has been dedicated to 
developing fully stochastic models. Stein (1999) has demonstrated in RADAR detection 
applications that inclusion of realistic probability models can significantly improve the 
probability of detection for a given false alarm rate, so it is reasonable to believe that 
such models could have applications in spectral imagery. 

To date, the majority of authors have assumed that HSI distributions could be 

made up by considering only signals and noise. A best spectral signature was determined 

based on available data, then all deviations from this were considered to be due to noise, 
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which is generally assumed to be multi- variate normal in nature. It is known that such a 
strategy is only an approximation, as two pixels that both contain the same target or 
material will almost invariably have different spectra, even under the same illumination 
and atmospheric conditions and in the absence of noise. Basically, this means that the 
targets in and of themselves are best described by statistical distributions, not just the 
mean spectrum. Approximating the target distributions by the mean spectrum is a first 
order approximation that matches only the first moment of the actual statistical 
distribution of the target spectra-namely the mean. Healy and Slater (1999) have 
demonstrated an improvement to this strategy. Their algorithm computes a space of 
allowed target distributions based on variations in illumination and atmospheric effects, 
and then projects a pixel spectrum onto that space. A classification decision is made 
based on the angle between the pixel spectrum and the target hyperplane when simple 
normal statistical models are assumed for the noise. Such an algorithm provides more 
robust performance, but necessitates exhaustive calculations of potential signatures from 
each target of interest, and while they reduce the dimensionality of the target hyperplane 
as much as possible, they do not weight regions of this hyperplane where the target 
spectra are most likely to lie based on statistical distributions. Furthermore, the target 
hyperplane is composed by assuming only a mean spectral reflectance, assuming all 
variations are due to changes in environmental parameters. 

This thesis represents an attempt to understand the actual statistical distributions 
that can be expected from targets in spectral imagery. In this investigation, analysis is 
restricted to single bands, but it is seen that deviations from normal distributions are 
common, even when there is enough radiance that one might assume that such a model is 
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sufficient. Chapter II gives a brief overview of the history of hyperspectral imagery and 
gives a basis for the work accomplished in this study. Chapter III provides a detailed 
description of the spectral imagery scene used for this study and the process by which 
histograms are generated. Several histograms display the nature of the regions of interest 
that are examined. Chapter IV covers the statistical analysis of the hyperspectral data, 
including total radiance modeling, single band modeling, and goodness-of-fit testing. 
Chapter V is a discussion of the implications of the work in this thesis and includes a 
discussion into further work to be considered in this area of research. Chapter VI 
concludes the thesis. 
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H. BACKGROUND 



A. HYPERSPECTRAL IMAGING HISTORY 

This thesis deals with a specific field of remote sensing called imaging 
spectrometry, also known as hyperspectral imaging (HSI). HSI sensors are capable of 
providing remotely sensed data in extremely narrow spectral bands, which permit the 
identification of materials on the earth’s surface. Every material has different reflective 
and emissivity characteristics that vary with wavelength and material composition. A 
hyperspectral sensor obtains a spectrum of reflected and self-emitted photons that can be 
used to uniquely identify a certain material. The concept of imaging spectrometry is 
displayed in Figure 2.1 (Vane and Goetz, 1988, p.2). Each picture element or pixel in the 
scene contains enough information for the reconstruction of a reflectance spectrum. This 
allows the narrow-band spectral features to be used for material identification. 




Figure 2.1. Concept for Imaging Spectrometry. From Vane and Goetz, 1988, p.2. 
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Scientists have been using spectral data for decades to identify numerous 
characteristics about the earth such as vegetation growth rates, geologic mapping, forest 
fire damage, shallow water monitoring of oil spills, and mineral identification. Recent 
advancements in HSI have led to sub-pixel object identification and various military 
applications such as the ability to recognize a target from its spectral signature without 
spatially resolving it, ability to discriminate targets from backgrounds, decoys, and each 
other, and the increased potential for robust automated detection using tractable 
algorithms, even in highly segmented and cluttered scenes (Kailey and filing, 1996, 

P.-15). 

The spectral imaging area in remote sensing has undergone tremendous 
development in the last three decades. In 1972, the Landsat I satellite was launched 
with one of the first multispectral imaging devices onboard, the Multispectral Scanner 
(MSS). This sensor used a system of four filters and detectors to split image data into 
multiple spectral bands in which the detector elements mechanically scanned a swath of 
the earth’s surface perpendicular to the instrument flight path in a sweeping-like motion. 
Shortly after the development of the MSS, a second-generation Landsat sensor, the 
Thematic Mapper (TM) was created with seven filter/detector pairs. While both the MSS 
and TM were significant advances in remote sensing technology, they lacked the spectral 
resolution and number of spectral bands needed for more precise surface measurements. 
A. F. Goetz, a pioneer in imaging spectrometry, best described this problem in the 
following paragraph: 

The need for imaging spectrometry grew out of recognition in the 1970’s based 
on laboratory and field spectral measurements, primarily of rocks and soils, that 
multispectral imaging in 4 spectral bands with the Landsat MSS was not adequate 
to discriminate among, much less identify, minerals on the earth’s surface that 
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were important in resource exploration and environmental assessment. However, 
in the 1970’s the technologies of optics, detectors, digital electronics and 
affordable computing, equivalent to the workstation platforms that we are 
presently accustomed to, did not exist. The ideas and the vision were also too 
new to gain immediate acceptance by program managers holding the new- 
technology purse strings (Goetz, 1995, p.3). 

The next major advancement in spectral imagery came in 1981 when the Shuttle 
Multispectral Infrared Radiometer (SMIRR) flew on the second flight of the NASA 
Space Shuttle. The SMIRR proved that identification of surface mineralogy could be 
made from orbit with continuous, narrow-band spectral measurements (Goetz, 1995, p.3). 
SMIRR was a 10-channel radiometer, where three of the channels were only 10 nm wide 
and centered 10 nm apart around 2.2 pm. These channels, and a fourth centered at 2.35 
pm, made it possible to positively identify limestone and the mineral kao Unite (Goetz, 
1995, p.3). 

The stage was then set for the next generation of remote sensing systems which 
contained dispersing elements aUowing acquisition of spectral data over hundreds of 
narrow, contiguous spectral bands. These new systems came to be caUed imaging 
spectrometers, a name that is still used today. The Airborne Imaging Spectrometer (AIS) 
was built in the early 1980’s as a test bed for higher resolution imaging spectrometers and 
yielded a number of advances in detector technology and data analysis (V ane and Goetz, 
1988). The AIS made use of a grating spectrometer that separated a signal into 128 
adjoining bands in the spectral region from 1.2 to 2.4 pm, instead of a system of filters 
and detectors. In order to be able to cover this spectral region with 9.3 nm sampling, the 
grating was stepped through 4 positions in the 40 ms needed to traverse one pixel on the 
ground. AIS 2, a later version of the sensor, incorporated a larger element array and 
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imaged over the 0. 8-2.4 pm region. The AIS system, however, had its problems, among 
them, lack of full spectral coverage in the visible and shortwave infrared and also a very 
narrow field-of-view (FOV). The success of AIS led NASA to create an imaging 
spectrometer program from which the next generation aircraft system, the Airborne 
Visible/Infrared Imaging Spectrometer (AVIRIS) was developed. 

AVTRIS came into operation in 1987 and is still one of the primary providers of 
imaging spectrometer data to the research community today (Vane, 1993). It covers the 
0.4-2.4 pm region with 224 spectral bands and has a resolution on the order of 10 nm, 
allowing it to detect most absorption features. The AVIRIS optical system consists of a 
scanner, four spectrometers, and a calibration source, all coupled by fiber optics. In fact, 
AVIRIS was the first spectral imaging system that was radiometrically calibrated, making 
it possible to apply atmospheric models to obtain surface reflectance (Goetz, 1995, p.6). 
AVIRIS has acquired data for a wide variety of studies including coastal zone water flow, 
snow grain size measurement, determination of water status in vegetation, and coniferous 
forest damage. Also, the increased spectral range in the visible region provided by 
AVIRIS allows biologists to study important reactions in shallow water biology. 

The Hyperspectral Digital Imagery Collection Experiment (HYDICE) is one of 
the newest, high-resolution airborne imaging spectrometers in use today (Rickard, 1993). 
It covers the 0.4-2.5 pm spectral region with a 10.2 average spectral resolution. At an 
altitude of 6 km, its spatial resolution is 3m over a 636m swath when flown at low 
altitude (5,000 feet). Also, HYDICE uses a 210-element indium antimonide area detector 
array which is designed to cover the full spectral range using a single optical path design. 
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This is an impressive simplification of the AVIRIS design, which uses multiple 
spectrometer/detector pairs and an optical subsystem coupling. 

Another system of note is the Midcourse Space Experiment (MSX) satellite, 
which carries a hyperspectral imaging suite built by John Hopkins University Applied 
Physics Laboratory called the Ultraviolet and Visible Imagers and Spectrographic 
Imagers (UVISI). This sensor suite covers the wavelength region . 1 1-.9 pm and provides 
a spectral resolution on the order of .5-4.3 m The next generation spacebome spectral 
sensor is the Moderate Resolution Imaging Spectrometer (MODIS) which is on-board the 
Earth Observing Satellite (EOS-1). This sensor encompasses the earth every 1 to 2 days 
with a spectral resolution of 10-15 nm in the visible (Hemandez-Baquero, 1997, p.l 1). 

B. HYPERSPECTRAL DATA ANALYSIS TECHNIQUES 

The detection of a certain target or material is achieved by using the large amount 
of information available from its observed spectrum. Thus, the problem presented is to 
localize spatially the parts of the spectrum that are characteristic of the target material. 
There are many different techniques that can be applied to this problem; however only 
the most prominent will be briefly discussed here. 

Stefanou (1997) surmised that there are five major strategies for the analysis of 
hyperspectral data. Figure 2.2, shows a listing of these five strategies and the key points 
of each. There are different methods that can be applied within each strategy as well, 
however, they will not be discussed in this thesis. 



9 



j Spectral Imagery Analysis Strategies 



Transformation and Projection 

- Linear transformation performed on each pixel 

- Goal is to find trie ‘right* set of basis functions and project image into “target" subspace 

- May act as a preprocessing step prior to classification 



Classification 

- Pixels assigned to spectral classes based on similar statistics 

- Stochastic outlook on data 

- Assumes spectrally pure pixels 



Linear Prediction 

- Exploits the spatial and spectral redundancy inherent in spectral images 

- Each pixel is viewed as a linear combination of its neighbors 

- Originally appBed to data compression 



Optimal Band Selection 

- Goal is to pick the best spectral bands that win discriminate a target 

- Band selection process may be deterministic or stochastic 

- Scene dependent 



Multiresolution Analysts 

- Goal is to use different spatial resolution images to form a high resolution composite image 

- Exploits spatial correlation between neighoring pixels 

- Concepts related to and involving wavelets 



Figure 2.2. Listing of Spectral Imagery Analysis Strategies. From Stefanou, 

1997, p.5. 



The first strategy entails the use of linear transformations and projections where 
data is visualized as belonging to either a signal subspace or a noise subspace (Stefanou, 
1997, p.8). A subspace is a linear algebra term that describes vectors with similar 
characteristics or properties. The inherit problem of the mixed pixel is assumed in most 
methods of this strategy, with a mixed pixel being an assumed linear mixture of 
spectrally pure materials called endmembers. Each pixel is treated independently, since 
no assumptions are made concerning the spatial arrangement of the pixel vectors. This 
has led to a search for more effective techniques within the strategy of linear 
transformations and projections where no knowledge of the background is assumed. 

The second strategy in analyzing hyperspectral imagery described by Stefanou 
(1997, p.9) is a pixel classification approach. Observed pixel spectra are assigned to 
classes based upon similar statistical criteria. This strategy assumes a stochastic outlook 
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of the data. The mixed pixel problem is not assumed, and the target spectrum is 
discriminated based on its membership in a class separate from background pixels. 

The third strategy is formulated upon the theories of linear prediction. It makes 
the assumption that each pixel vector is a linear combination of its neighbors, and seeks 
to take advantage of this relationship. The postulate is to create a residual image in 
which there is less redundancy spectrally and spatially. The data from this strategy is 
viewed statistically and is modeled as such (Stefanou, 1997, p.9). 

The fourth strategy is the process of optimal band selection. The goal is to select 
the best original spectral bands of the hyperspectral image that can be used to 
discriminate the target. The band selection process is guided by a either a statistical or 
deterministic view of the target and background spectra. No explicit assumption about 
the linear mixing model is made (Stefanou, 1997, p.9).‘ 

The fifth strategy involves the use of multi-resolution techniques. Concepts such 
as wavelets are used to pick out different levels of detail from the hyperspectral image. 
The spatial correlation between neighbors is exploited, though a statistical view is not 
necessarily required (Stefanou, 1997, p.9). 

The goal of this research is to have a significant impact on the first two spectral 
imagery analysis strategies. The linear projection techniques assume a mean or average 
spectrum of the target pixels with the addition of noise. However, more realistic 
statistical distributions describe target pixels better than a mean spectral signature. In this 
thesis, attempts were made to employ second order statistics (variance) along with non- 
normal models to improve performance of these spectral imagery analysis techniques, 
although no attempts were made to improve classifiers. 



11 



In this thesis, a priori knowledge was used about HSI data obtained from Davis 
Monthan Air Force Base to be able to classify individual pixels as being a part of a 
certain target or background class. Since the spatial arrangement of the scene was 
known, pixels were selected by close, visual inspection of objects, and by using 
individual pixel spectra to determine if a certain pixel was indeed from a given target 
class. After creating regions of interest based on this technique, a detailed, higher order 
statistical analysis was then performed on each of the regions of interest, as is discussed 
in Chapter IV. This thesis utilizes The Environment for Visualizing Images (ENVI) 
version 3.2, Interactive Data Language (IDL) version 5.2.1, and MATLAB version 
5.3.1.29215a, for the analysis of the HSI data from Davis Monthan Air Force Base. 
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III. HISTOGRAMS 



A. HYPERSPECTRAL SCENE: DAVIS MONTHAN AIR FORCE BASE 
The hyperspectral data used in this thesis was gathered from a Hyperspectral 
Digital Imaging Collection Experiment (HYDICE) imagery scene of Davis Monthan Air 
Force Base (DMAFB) in Arizona. Two sets of HYDICE passes (GSD of 1.5 meters) 
were taken of DMAFB in June and October 1995. This scene was chosen for three 
primary reasons. The first reason was for its inherently consistent background 
characteristics. The predominate background material is sand, which, was easy to 
separate from targets due to its consistent spectral characteristics. The second reason 
DMAFB was chosen was for its variability of reference target data. There are hundreds 
of aircraft, clustered by type, positioned in a designated area of the base - the aircraft 
boneyard. The type of aircraft include, but are not limited to, the A-6 Intruder, P-3 
Orion, C-130 Hercules, B-52 Stratofortress, F-4 Phantom, and AV-8B Harrier. The 
aircraft are also painted in varying paint schemes such as gray, white, and camouflage. 
The combination of numerous aircraft and varying paint schemes provides a broad 
selection of spectral targets for hyperspectral analysis. The third reason Davis Monthan 
was used was for its ability to provide hundreds of classified pixels for each target class. 
Having an extremely large group of classified pixels helps to ensure that the statistics 
compiled are accurate and reliable. Figure 3.1 shows an overhead view of the portion of 
the October and June over-flights of Davis Monthan AFB analyzed here. Various types 
of aircraft are identified in the figure, as well as a fence line that separates the airfield and 
nearby road. 
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Figure 3.1. HYDICE scenes of Davis Monthan AFB (October scene is to the left). 
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Figure 3.2 shows a portion of area 4, which contains P-3 aircraft, and a portion of 
area 5, which contains C-130 aircraft. The two areas are separated by a road that runs 
diagonally across the figure (highlighted in white). The aircraft in both areas are painted 
in either a white, a gray or a black-brown-green camouflage pattern. Given the feet that 
areas 4 and 5 have a variety of distinct targets and were covered by both passes of the 
HYDICE sensor, the majority of the data used in this research was taken specifically 
from these two areas. Using the data gathered from these two areas enables further study 



in the difference between passes. 



Area 4 




~\ 




Area 5 



J 



Figure 3.2. Areas 4 and 5 of Davis Monthan scene (October over-flight). 



Davis Monthan Air Force Base, given its advantages as a hyperspectral scene, 
made it easy for regions of interest (ROIs) to be identified. ROIs are groups of individual 



target pixels that are grouped together to uniquely classify them as being a certain object. 
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In HSI research, objects are usually classified based on their spectral signature (radiance 
or reflectance) as a function of wavelength. For the purposes of this study, pixels are 
grouped into ROIs based on a priori knowledge of the types and locations of the aircraft. 
From these ROIs, histograms were developed that are used for statistical modeling. In 
this thesis, each ROI was developed through careful analysis of individual pixel profiles 
in order to establish a common relationship among the pixels, such as being a part of the 
same P-3 Orion aircraft. This was accomplished in ENVI by zooming into an individual 
aircraft so that each pixel is clearly discernible. Then, based on pixel profiles and view of 
the aircraft body, individual pixels were selected and identified as being “pure” and 
associated with that type of aircraft. Figure 3.3 shows this pixel identification process for 
a P-3 aircraft. Pixel 1 is clearly on the aircraft, pixel 2 is clearly on the ground, and 





Figure 3.3. Pixel identification process of a P-3. 



16 



pixel 3 is a mixed pixel since it falls on the edge of the aircraft and is likely to contain 
both background and target characteristics. Given that pixels 1 and 3 are pure pixels, the 
overall shape of the spectrum corresponding to each can be identified and the spectrum of 
pixel 2, which is intermediate between the two, can be discarded. This enables the 
selection of only pixels that are reliably on the aircraft in order to classify those pixels as 
being a part of the P-3 ROI. 

For both the C-130 and P-3 ROIs, pixels were selected from dozens of different 
aircraft that all had the same paint scheme. This prevented ambiguity between pixels 
since different colors of paint have different spectral signatures and are essentially 
different spectral targets. Figure 3.4 displays the P-3 aircraft from which pixels were 
chosen to create the P-3 ROI and the C-130 aircraft that were used to create the C-130 
ROI. The selected aircraft are bordered by white lines. Over 1700 “pure” pixels were 
included in the C-130 ROI and over 1400 in the P-3 ROI to develop a large enough data 
set to perform reliable statistics. 



This 

aircraft not 
included 




Figure 3.4. P-3 and C-130 aircraft selected for inclusion in Regions of Interests. OnlyC- 
130s painted camouflaged were used, here the jagged lines border the selected aircraft. 
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One of the potential applications of HSI is to detect targets that are sub-pixel in 
size. Thus, a third region on interest, a fence line, was chosen to give statistical 
information about sub-pixel targets. When a target is smaller than a pixel, the target 
areas are by definition not pure. The measured reflectance is therefore a combination of 
the various elements in the spectral scene. Figure 3.5 shows a region of the image that 
contains the fence line (highlighted in white) that separates the boneyard from the road. 



Figure 3.5. Fence line ROI. 




Initially, the fence line presented some difficulty in classifying individual pixels; 
however by comparing one pixel on either side of a fence pixel, a clear indication of the 
fence line spectral profile became evident. Figure 3.6 shows how a fence line pixel was 
selected. The top portion is a close-up view of the fence line and the bottom portion 
shows the pixel spectra. 
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Pixel 2 



Pixel 1 
Pixel 3 



Fence Line pixels 



Pixel 2 



Pixel 3 
Pixel 1 

500 1000 1.500,, 2000 250 

Wavelength 

Figure 3.6. Fence line pixel classification. 

Pixel 1 is on the top of the fence line with pixels 2 and 3 being on either side of 
the fence. These three spectral signatures are very similar, but as can be seen from the 
spectral plots, pixels 2 and 3 show a higher overall radiance value than the profile of 
pixel 1 . This behavior was consistent along the entire fence line and was most likely due 
to shadowing. Thus, pixels 2 and 3 were eliminated and pixel 1 was selected as being a 
part of the fence line ROI. 
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In order to further examine the characteristics of the entire DMAFB scene and to 




compare potential “targets” with “background”, two separate background regions of 
interest were created. These ROIs are primarily composed of sand and dirt and provide a 
good basis for comparison with the other selected ROIs since sand and dirt have a higher 
overall reflectance than the other ROIs. Figure 3.7 shows the areas of the Davis Monthan 
scene that contain the background ROIs. 



Figure 3.7. Regions of DMAFB for Background ROIs. The picture on the left shows the 
region from which the first background ROI is taken. The picture on the right shows the 
region from which the second background ROI is taken. The specific areas are bordered 

with white lines. 



These five regions of interest provided the data from which various histograms 
where generated and analyzed- Table 3.1 shows the number of pixels included in each 



ROI. The following section describes the process that created these histograms. 



ROI 


# of Pixels 


! C-130 


1740 


P-3 


1689 


Fence Line 


319 


Background 1 


5672 


Background 2 


4725 



Table 3.1. Number of Pixels included in each ROI. 
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B. HISTOGRAM GENERATION 



Three different variables were used for the statistical analysis in this thesis, total 



The total radiance histograms were used to gain an overall understanding of the data 
being examined since each histogram exhibits unique characteristics due to illumination 
variations. A program was written in MATLAB that extracts the total radiance data for 
each pixel in the regions of interest and creates a histogram of the data. A copy of all the 
computer programs used in this thesis is included in Appendix A. 

I. Total Radiance Histograms of the Regions of Interest 

The histograms presented here represent the total radiance across all bands for 
each respective ROI. Each histogram exhibits unique characteristics due to illumination 
variations resulting from spatially changing atmospheric effects, shading, shape, and 
other physical properties of the material being investigated (Kruse, 1988). The total 
radiance at each pixel is simply computed as 



where L(x, y, A) represents the radiance of each pixel at a certain wavelength (A). The 
histogram for the total radiance within the C-130 ROI for the October over-flight is 
shown in Figure 3.8. 



pixel radiance, single band normalized and single band unnormalized spectral radiance. 




(3.1) 



A 



21 



Total Radiance Histogram of C-130 ROI 




Figure 3.8. Total Radiance Histogram of C-130 ROI. 



A purely cursory examination of this histogram reveals that it is not normal in 
nature but rather skewed to the left with a long right tail. This means that a normal 
distribution probably will not fit this data well. In the following chapter, different 
statistical distributions will be fitted to this histogram in an attempt to model it 
effectively. 

Figure 3.9 displays the P-3 ROI histogram. This has a bi-modal distribution, 
shown as two humps in the histogram, which makes it difficult to model Further 
investigation revealed that the cause of the bi-modal distribution was due to fluctuations 
in radiance along the fuselage of the P-3 aircraft. When selecting pixels for inclusion in 
the P-3 region of interest, pixels were taken from across the entire surface of the aircraft, 
including the fuselage and the wings. 
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Counts S' Counts 



Total Radiance Histogram of P-3 ROI 




Figure 3.9. Total Radiance Histogram of P-3 ROI. 



Figure 3.10 shows histograms of the total radiance across the P-3 fuselage and the 



The wings show a single mode distribution. 



Total Radiance Histograms of P-3 Fuselage and P-3 Wings 




Figure 3.10. Total Radiance Histograms of P-3 Fuselage and P-3 Wings. 
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The fuselage, however, seems to be the contributing factor to the bi-modal 
nature of the entire P-3 ROI. This feature of the P-3 ROI makes it difficult to adequately 
model with the statistical distributions considered here. Figure 3.11 displays the 
histogram for the fence line ROI. This region of interest was chosen in an effort to show 
the difficulties in analyzing sub-pixel targets. The histogram is slightly skewed to the 
right with a short, left tail. 



Total Radiance Histogram for Fence Line ROI 




Figure 3.11. Total Radiance Histogram of Fence Line ROI. 



Figure 3.12 shows the histograms of the background ROIs. They both are slightly 
right skewed with short left tails. These histograms will be compared to the same 
statistical distributions as the previous histograms to check for goodness of fit. 
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Total Radiance Histograms of Background ROIs 




Figure 3.12. Total Radiance Histograms of Background ROIs. 



2. Single Band Histograms of the Regions of Interest 

Using EDL, a program was written that generates single band, normalized and 
unnormalized radiance histograms from selected ROIs. The program first accepts a ROI 
as input and normalizes the data in that ROI. This computation is a standard operation 
performed to eliminate the effects of albedo in processing spectral data (Kruse, 1988). A 
region of interest is normalized by dividing the spectral radiance value of each pixel in 
the ROI by the total radiance of all bands in that pixel. The normalization function can 
be written as: 



L(x,y,X) 



L(x,y,X) 

Y^Ux,y, A) 



A 



(3.2) 



Equation 3.2 scales the spectrum of each pixel to approximately the same overall relative 
brightness. The normalization process is critical in reducing the number of ambiguities in 
spectral data. Much of the variability in a spectral scene is due to apparent illumination 
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differences that arise from shading and curvature of target surfaces. For example. 
Figure 3. 13 shows two pixel spectral profiles from the P-3 ROI that are not normalized. 
Figure 3. 14 shows these same spectra after the normalization process. 




Figure 3.13. Individual P-3 spectra before normalization 




Figure 3.14. Same spectra after the normalization process. 

The two bands have been normalized to the same relative brightness, which makes it 
more apparent that the two spectra are similar in nature. 

After the normalization is complete, a subset of the 210 HYDICE bands was 
chosen by investigating the mean spectral signature of the various ROIs. Each ROI has a 
mean specific signature due to global factors of illumination, absorption, and reflectance. 
Likewise, each pixel spectrum has numerous fluctuations about the mean due to 
individual pixel characteristics such as orientation, shading, and imperfections on the 
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target surface. As an example. Figure 3.15 shows a profile of a pixel taken from the 
fence line ROI. There are peaks and valleys in the spectrum that correspond to different 
characteristics of the scene, illumination, and atmosphere. In order to understand the 
statistics of different portions of the spectral signature, bands 50, 68, 75, 80, 88, 94, 1 14, 
and 117 were chosen. The points corresponding to these bands are highlighted in 
Figure 3.15. 




Figure 3.15. Pixel spectrum from fence line ROI with highlighted bands. 



Once the bands were selected, the histograms could then be calculated. For each 
band, the range of normalized radiance was separated into bins for proper viewing of the 
histogram. The number of bins chosen depends on the ability to give reasonably smooth 
histograms for analysis. A copy of the program written to compute these histograms is 
given in Appendix A. The following section will provide examples and discussion of the 
various histograms that were generated for each region of interest. 
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a. Unnormalized Single Band Histograms of the ROIs (October) 

The next step in the analysis of the Davis Monthan data is to compute and 
display the single band histograms for each of the respective ROIs from the October data. 
Each of the histograms in this subsection is computed using unnormalized spectral 
radiance. As was described earlier, each ROI is comprised of data that is divided into 
210 spectral bands where each band exhibits its own unique spectral signature. By 
researching the characteristics of several of the spectral bands within a ROI, one can 
determine which statistical distribution fits best for specific bands, and hopefully gain 
insight into its overall distribution. The first step is to look at the unnormalized single 
band histograms and observe the significant differences between the normalized and 
unnormalized data. The C-130 and P-3 wing ROIs will be presented here as examples of 
what was seen across all five regions of interest. 

Figure 3.16 shows the C-130 unnormalized single band histograms. AH of 
the histograms appear left skewed with right tails. 



Unnormalized Single Band Histograms of C-130 ROI 




Figure 3.16. Unnormalized Single Band Histograms of C-130 ROI 
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The next set of unnormalized single band histograms are taken from the P- 
3 wing ROI. Figure 3.17 displays the unnormalized single band histograms taken from 
the P-3 wing ROI. The histograms of Bands 59 and 84 were not shown in this figure for 
purposes of clarity since they overlapped the other histograms. Again, the histograms are 
left skewed with right tails. 



Unnormalized Single Band Histograms of Wings, P3 ROI 




Figure 3.17. Unnormalized Single Band Histograms of P-3 Wing ROI. 



The general trend that was seen in Figures 3.16 and 3.17 of left skewed 
histograms with right tails, was seen in the other regions of interest as well. The 
following subsection will display normalized radiance histograms from the October over- 
flight data. 
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b. Normalized Single Band Histograms of the ROIs (October) 

The next step in the analysis of the Davis Monthan data is to compute and 
display the normalized radiance single band histograms for each of the respective ROIs 
from the October data. Each of the histograms is computed using normalized spectral 
radiance, using Equation 3.2. For each of the regions of interest, several histograms of 
single bands are displayed from the 210 HYDICE bands. In a separate figure, a few of 
these histograms are displayed in an effort to capture the general variability that is seen 
across all 210 HYDICE bands. Figure 3.18 shows several histograms of bands from the 
C-130ROI. 




0 0.002 0.004 0.006 0.008 0.01 0.012 0.014 0.016 0.018 0.02 

Normalized Radiance 

Figure 3.18. Single band histograms of C-l 30 ROI. 
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The 22 histograms of individual bands shown in Figure 3.18 show a 
representative sample of the 210 bands in this ROI. Some of the histograms look 
symmetric while others are skewed either to the left or right. In order to see the general 
variability that is seen across all bands, six of the histograms were plotted in Figure 3.19. 



Selected Single Band Histograms of C-130 ROI 




Normalized Radiance 



Figure 3.19. Selected single band histograms of C-130 ROI. 



These six histograms capture the general variability that is seen across all 210 bands. 
Bands 1 and 34 are from the visible portion of the spectrum where the measured radiance 
is high, and while these are the most symmetric distributions, they still appear somewhat 
skewed. Band 53 is at the edge of the visible, and has its long tail to the left. Bands 81, 
92, and 120 are moving through the Near Infrared (NIR) where total response is 
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decreasing, and the distributions are becoming more skewed to lower values with tong 
right tails. The nature of the normalized single band histograms seen in Figure 3.19 is 
significantly different from the unnormalized histograms seen in Figure 3.16 where all of 
the histograms were left skewed with right tails. Figure 3.20 displays various normalized 
radiance single band histograms from the P-3 ROI. 




0 0.002 0.004 0.006 0.008 0.01 0.012 0.014 0.016 



Nonnalized Radiance 

Figure 3.20. Various normalized single band histograms of P-3 ROI. 

This figure shows the variability seen across all bands of the P-3 ROI. In 
some of the histograms, there seems to be evidence of the bi-modal distribution that was 
seen in the total radiance histograms. By viewing a selection of these single band 
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histograms, the typical nature of the distributions in this ROI should become evident. 
Figure 3.21 displays six of the histograms from the P-3 ROI. 



Selected Single Band Histograms of P-3 ROI 




Figure 3.21. Selected Single Band Histograms of P-3 ROI. 



The six histograms shown seem to show the bi-modal characteristic of the 
P-3 ROI. A comparison between these histograms and distributions from just the wing 
data of the P-3 ROI are necessary to get more favorable histograms for statistical 
distribution fitting. The following figure shows selected single band histograms from the 
wing data of the P-3 ROI. The same bands that were used for analysis from the entire P- 
3 ROI were used in an effort to show the direct comparison between the two sets of data. 



33 



Figure 3.22 displays six histograms from this data. From the figure, it seems that the 
histograms, are more unimodal in appearance than the histograms from the entire ROI. 
This could be an indication, as was discussed previously, that the fuselage data is 
skewing the results of the P-3 ROI and causing a bi-modal appearance of the histograms. 
Also, the histograms below are much different from the unnormalized P-3 wing 
histograms which were all left skewed with right tails. Figure 3.23 shows six selected 
histograms from the fence line ROI. 



Selected Single Band Histograms for Wings of R-3 ROI 




Figure 3.22. Selected single band histograms for wings of P-3 ROI. 



Selected Single Band Histograms of Fence Line ROI 




Figure 3.23. Selected Single Band Histograms from Fence Line ROI. 
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In Figure 3.23, bands 50, 68, and 75 all appear right skewed with left tails 
and bands 88, 94, and 1 17 all appear left skewed with right tails. Bands 80 and 114 from 
Figure 3.15 are not displayed here since their histograms overlapped the histograms of 
Bands 68 and 117 respectively. However, the general nature of the fence line spectrum is 
still captured using the above six bands. 

The next figure displays selected single band histograms from the first 
background region of interest. Since the background is mostly comprised of sand, it 
exhibits a very even, spectral distribution with few irregularities. Figure 3.24 displays the 
six histograms that most closely correspond to the nature of all 210 bands across this 
ROI. 



Selected Single Band Histograms of First Background ROI 




Figure 3.24. Selected Single Band Histograms of First Background ROI. 
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The histograms in Figure 3.24 are fairly symmetric, with the exception of 
Band 52, which is skewed to the right. The following chapter will discuss the ability of 
the normal distribution to fit these single band histograms. As a comparison. Figure 3.25 
displays single band histograms from the second background ROI. This ROI was 
comprised of sand and dirt dominated pixels that were located a few hundreds of meters 
away from the first background ROI. The similarity in shape of the single band 
histograms of the second background ROI with the single band histograms of the first 
background ROI can be seen. 



Single Band Histograms of Second Background ROI 




Figure 3.25. Selected Single Band Histograms of Second Background ROI 



C. JUNE 1995 HYDICE OVERFLIGHT DATA 

The Davis Monthan data that has been presented thus far was taken from a 
HYDICE image taken in October 1995. This next section will describe data that was 
taken from a HYDICE image from the June 1995 over-flight. The three regions of 
interest created from the June data, a C-130, P-3 Wings, and P-3 Fuselage ROI, were 
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made using essentially the same aircraft as the October data. As was done earlier in this 
chapter, the total radiance histograms of each ROI will be examined first, followed by the 
single band normalized and unnormalized histograms. 

Figure 3.26 displays the total radiance histogram of the C-130 ROI. It appears 
slightly left skewed with a right tail, which is very similar to the October C-130 total 
radiance histogram. The fact that the histograms look the same is important and will be 
discussed in greater detail in Chapter V. 



Total Radiance Histogram from June of C-130 ROI 




Figure 3.26. Total Radiance Histogram of C-130 ROI from June Over-flight 



Figure 3.27 displays the total radiance histograms of the P-3 Wings and P-3 
Fuselage ROIs. Again, these two histograms are similar in shape to the total radiance 
histograms from the October over-flight data. The bi-modal characteristic of the P-3 data 
is clearly evident in the histogram of the fuselage. 
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Total Radiance Histograms of P-3 Wing and Fuselage ROIs from June Overflight 




Figure 3.27. Total Radiance Histograms of P-3 Wings and Fuselage from June Overflight 



The next histograms that will be displayed are the unnormalized single band 
histograms from the June over-flight data. Figure 3.28 shows the unnormalized radiance 
single band histograms of the C-130 ROI. The appearance of these histograms is nearly 
identical to the unnormalized single band histograms from the October data that were 
shown in Figure 3.16. 
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Selected Single Band Histograms of Unnormalized June 0-130 data 




Figure 3.28. Unnormalized Single Band Histograms of C-130 from June Over-flight 



Figure 3.29 shows the unnormalized single band histograms of the P-3 Wings 
ROI. These histograms have a like appearance to the unnormalized histograms displayed 
in Figure 3.17 and based on the results from the statistical analysis in Chapter IV, exhibit 
similar distribution fits. 



Unnormalized Single Band Hist, of June from Wings, P-3 




Figure 3.29. Unnormalized Single Band Histograms of P-3 Wings from June Over-flight 
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The next figures shown are the normalized single band histograms from the ROIs 
of the June over-flight data. The normalized single band histograms for the C-130 ROI 
are shown in Figure 3.30. They are very similar in appearance to the normalized 
histograms of the October C-130 ROI that were shown in Figure 3.19, which is not 
surprising given the fact that the total radiance histograms were similar as well. 



Normalized Single Band Histograms from June of C-130 ROI 




Normalized Radiance 



Figure 3.30. Normalized Single Band Histograms of C-130 ROI from June Over-flight 
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Figure 3.3 1 displays the normalized single band histograms from the P-3 Wings 
ROI. The June data histograms are slightly different in appearance from the October data 
histograms, but each band histogram from the June data is still skewed the same way as 



each of the band histograms in Figure 3.22. 



Single Band Histograms from June of Wings, P3 




Figure 3.31. Normalized Single Band Histograms of P-3 Wings from June Over-flight 



This chapter described the Davis Monthan Air Force Base hyperspectral scene 
and the different aspects of the spectral data that was extracted from it. Total pixel 
radiance, unnormalized single band, and normalized single band histograms were 
displayed for various regions of interest from both the October and June over-flights in an 
effort to show the wide range of variability that exists in hyperspectral data. 
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IV. 



STATISTICAL MODELING 



A. BACKGROUND OF STATISTICS USED 

In this thesis, several statistical distributions were fitted to the Davis Monthan 
data The maximum likelihood estimators (MLEs) of the gamma, normal, and lognormal 
distributions, as well as the Weibull in some instances, were compared to the total 
radiance and single band histograms of the selected regions of interest. The % 2 goodness- 
of-fit test, a test that computes how well the MLEs fit the histograms, was used to provide 
a numerical basis for analysis of the MLE fits. This section will give a brief overview 
of the statistical distributions, maximum likelihood estimators (MLEs), and the % 2 
goodness-of-fit test used for this research. 

The method of maximum likelihood estimation is used to find the parameters of a 
given distribution that have the best chance of producing the observed data The 
likelihood function, gives the probability of realizing the observed sample as a function 
of the possible parameter values, the mean (p) and variance (cr 2 ) for the normal and 
lognormal, and a and f3 for the gamma and Weibull. Maximizing the likelihood gives the 
parameter values that are most likely to have produced the observed sample (Devore, 
1995, p.266). The method of MLE was used in this thesis to fit parameters to several 
different histograms of the total radiance as well as the single band normalized radiance 
within a respective region of interest. 

Based on the shapes of the histograms observed in Chapter HI, the normal, 
Weibull, gamma, and lognormal probability distribution functions were chosen in an 
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attempt to approximate the functional shape of the actual histograms for the DM dat a, 
The likelihood function for a normal probability distribution is 



fix-.M.c t 2 )= , 1 (4.1) 

V2 7TC7 

where p is the mean and a 2 is the variance. To find the maximizin g values of p and a 2 , 
we must take the partial derivatives of In (/) with respect to p and cr 2 , equate them to zero, 
and solve the resulting two equations. Normal pdfs are always symmetric and can 
produce both positive and negative observations. The likelihood function for the 
lognormal distribution is 



.e-MWv 2 )' (4.2) 

V 2ncr 2 x 

where a nonnegative random variable x is said to have a lognormal distribution if the 
random variable y = ln(x) has a normal distribution. Note that in this case, p and a 2 are 
not the mean and variance of x, but of In (x). Lognormal pdfs are always skewed left 
with long right tails. The Weibull and gamma distributions were also used in comparison 
to the histogram of the total radiance and the Statistics Toolbox (ver. 5.3.1) ofMATLAB 
was used to implement these distributions. MATLAB’s MLE for the gamma and 
Weibull use a numerical minimization technique based on a Neldor-Meade simplex 
search algorithm (MATLAB Statistics Toolbox Manual, 1999). The gamma probability 
distribution function (pdf) is 



x a -'e ~ xlfi , 

(a) 



(4.3) 



where a and p are the parameters of the distribution. Gamma pdfs are skewed left with 
long right tails and are non-negative. The Weibull probability distribution function is 
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- x a ~ l e~ {xl P)tt 



(4.4) 



fi 



where a and p are the parameters of the distribution. 

The Weibull distribution was explored in this research since it proved useful in 
RADAR applications (Stein, 1997). In contrast to the other distributions, the Weibull 



a good fit to most of the data in this study, and was difficult to implement. In some of the 
following figures, attempts were made to model this distribution, but overall the results 
were not promising. In some cases, implementation of the MLE Weibull failed to 
converge to accurate results, and was therefore omitted. 

The x 2 (chi-squared) goodness-of-fit test compares the number of observed events 
in a particular range of values to the number of expected observations as predicted by a 
particular probability distribution function. The first step in the chi-squared test is to 
divide the predictive pdf into approximately equal probability data cells. The next step is 
to compute the number of predicted and actual observations in each range. The last step 
is to compute the actual value of the chi-squared statistic and it is given by: 



considered, when the null hypothesis (Ho) is true (Devore, 1995). When parameters are 
estimated, the number of degrees of freedom (v) must be reduced to: 



distribution can be skewed in either direction. However, this distribution did not provide 




ail cells 



(actual observations - expected observations ) 2 
expected observations 



(4.5) 



When the number of observations is large, the statistic x 2 has approximately a chi- 



squared distribution with k- 1 degrees of freedom, where k is the number of data ranges 
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v = k - p - 1 , 



(4.6) 



where p is the number of parameters estimated (2 for all distributions considered in this 
thesis). The number that is produced from equation 4.5 is called the chi-squared value, 
which denotes the number on the measurement axis such that a (significance level) of the 
area under the distribution curve with v degrees of freedom, lies to the right of the chi- 
squared value. That is, the chi-squared value represents the values for which 100*a% of 
random sequences generated by the distribution would have a worse fit than the actual 
data. Basically, the lower the value of % 2 in Equation 4.5, the better a certain statistical 
distribution fits the data. 

B. STATISTICAL MODELING 

1. Total Radiance Analysis for October and Jane Overflights 

A first step in the analysis of the data is to examine the distribution of the total 
radiance in the images. It is expected that a significant portion of the variability in 
spectral imagery is due to ill umin ation variations resulting from spatially changing 
atmospheric effects, shading, shape, and other physical properties of the image. Removal 
of these effects is an important first step in HSI data processing (Kruse, 1988). The 
histogram for the total radiance within the C-130 ROI and the corresponding fits for the 
pdfs considered here are presented in Figure 4.1. 
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Total Radiance Histogram of 0-130 ROI with Chstrltxjtion Fitting 




Figure 4.1. Total Radiance Histogram of C-130 ROI with Distribution Fitting. 



As was seen in Chapter m, the distribution of the total pixel radiance in the C-130 ROI is 
highly asymmetric, a normal distribution does not approximate the function well, and the 
MLE for the Weibull did not converge. The gamma and lognormal distributions capture 
more of the functional shape of the histogram, but even the lognormal is not quite skewed 
enough to fit the histogram. 

A x 2 goodness-of-fit test was then performed on the total radiance data from the 
C-130 ROI. The x 2 test was described earlier in this chapter and was used here for a 
visual representation of how well each statistical distribution fits the C-130 data and to 
determine how many degrees of freedom to use for the statistical analysis later in this 
chapter. An important part of using the chi-squared test is to choose the right number of 
degrees of freedom. Too many degrees of freedom may allow poorly fitting distributions 
to appear acceptable, while too few may have the opposite effect. The objective of the 
next figure is to demonstrate that the performance of the different distributions is stable 



across the number of degrees of freedom. In Figure 4.2, curves of equal probability are 
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shown for the chi-squared distribution. These curves give the chi-squared for which 
100*a% of random sequences generated by the distribution would have a worse fit than 
the actual data. As can be seen in the figure, the lognormal is still not ideal, as a « 0.05. 
It should be noted that the chi-squared test heavily penalizes spurious observations at the 
tails of the distribution. While the lognormal distribution has a significance of only 5%, 
it is clearly a much better fit than the other distributions considered for this ROI. The 
computed statistical significance of the normal distribution is exactly zero (to floating 
point precision). Alpha is approximately constant, so 5 degrees of freedom was chosen 
for the rest of this study. 



Chi-squared Goodness of fit for C-130 data 




Figure 4.2. Chi-squared values for the distributions of the C-130 ROI. 
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Table 4.1 shows the parameter values of the best distribution fit for the C-130 
ROI. The statistical significance (a) is fairly low for the lognormal, but still better than 
both the gamma and normal distributions. 



Distribution 


P 


a 2 


x 2 


a 


Lognormal 


13.49 


.1095 


11.03 


.05 


Gamma 


- 


- 


26.29 


0 


Normal 


7.7x1 0 5 


8.67xl0 10 


113.41 


0 



Table 4.1. Parameters for distribution fits for C-130 ROI. 



Total Radiance Histogram for P-3 ROI with Distribution Fitting 




Figure 4.3. Total radiance histogram for P-3 ROI with distribution fitting. 



Figure 4.3 shows the P-3 ROI and the different distribution models. As discussed 
in Chapter HI, the histogram of the P-3 ROI is bi-modal, which makes it nearly 
impossible to model adequately. As is evident in Figure 4.3, none of the distributions 
provide a proper fit for this ROI, as should be expected due to their uni-modal 
characteristics. For such problematic data, more advanced methods such as non- 
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parametric statistics may have to be employed (Devore, 1988). Table 4.2 shows the 
parameters for the lognormal, which was the best fit, although statistically insignificant, 
and the gamma and normal distributions. 



Distribution 


P 


a 2 


x 2 


a 


Lognormal 


13.45 


.0200 


221.4 


0 


Gamma 


- 


- 


245.46 


0 


Normal 


7.0x1 0 5 


l.lxlO 10 


370.70 


0 



Table 4.2. Parameters for distribution fits for P-3 ROI. 



In Chapter HI, the P-3 ROI was broken into two parts, the fuselage and the wings. 
It was shown that the main contributor to the bi-modal nature of the P-3 ROI total 
radiance histogram was anomalies in the fuselage data. Figure 4.4 displays the total 
radiance histogram of the aircraft wings from the P-3 ROI with respective distribution 
fits. 



Total Radiance Histogram for Wings of P-3 ROI with Distribution Fits 




Figure 4.4. Total Radiance Histogram for Wings in P-3 ROI with Dist. Fits. 
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The model statistical distributions fit much better with the wing data, as they 
should. The lognormal distribution fit the P-3 wing histogram the best. Since the bi- 
modal characteristics of the P3 ROI prohibited accurate modeling with statistical 
distributions, finther analysis of this ROI will be concentrated on the wing data. Table 
4.3 lists the parameters for the distributions. 



Distribution 


P 


a 2 


x 1 


a 


Lognormal 


13.46 


.0219 


106.04 


0 


Gamma 


- 


- 


114.80 


0 


Normal 


7.1x10 s 


1.14xl0 10 


159.32 


0 



Table 4.3. Parameters for distribution fits for P-3 Wing ROI. 



Figure 4.5 shows the Fence line ROI data distribution with the respective 
statistical models. The histogram is slightly asymmetric with a left tail. All three models 



Total Radiance Histogram for Fence Line ROI with Distribution Frtting 




Figure 4.5. Total Radiance Histogram for Fence Line ROI with Distribution Fitting 
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Counts 



are nearly identical in shape in this figure, and capture some of the histogram’s shape. Of 
the three models, the normal was the best fit for this histogram. Table 4.4 shows the 
parameters for the distribution fits. 



Distribution 


4 


a 2 


T 2 


a 


Normal 


6.4x1 0 5 


2.3xl0 10 


7.67 


.10 


Lognormal 


13.34 


.0581 


23.25 


.0002 


Gamma 


- 


- 


16.81 


.0054 



Table 4.4. Parameters for distribution fits for Fence Line ROI. 



Total Radiance Histogram for First Background ROI with Dist. Fits 




Figure 4.6. Total Radiance Histogram for First Background ROI with Dist. Fits 



Figure 4.6 displays the first background ROI and the fitted statistical distributions. 
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The histogram is slightly skewed to the right, and all three statistical models appeared to 
fit somewhat well. The parameter values are shown in Table 4.5. The lognormal had the 
lowest chi-squared value of the three distributions, but was statistically insignificant. 



Distribution 


P __ 


a 2 


. x 2 


a 


Lognormal 


13.43 


.0260 


138.3 


0 


Gamma 


- 


- 


866.8 


0 


Normal 


6.9x10* 


9.2x1 0 10 


569.5 


0 



Table 4.5. Parameters for distribution fits of First Background ROI. 



Total Radiance Histogram for Second Background ROI with Dist. Fits 




Figure 4.7. Total Radiance Histogram for Second Background ROI with Dist. Fits. 



Figure 4.7 shows the second background ROI histogram with fitted distributions. 

None of the distributions fit this histogram due to its right skewed nature and large peak. 

The reason for this is that the above histogram has a left tail and you need a distribution 

with a left tail to model it. The gamma and lognormal plots have right tails while the 

normal is symmetric. The Weibull was included because of its ability to model such 

shapes, but as discussed earlier, there were implementation difficulties. Table 4.6 shows 
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the parametric data from the lognormal, which, while it was the best of the three 
distributions, was still statistically insignificant. 



Distribution 


F 


a 2 


x 2 


a 


Lognormal 


13.44 


.1164 


2262 


0 


Gamma 


- 


- 


2992 


0 


Normal 


7.3x10 s 


7.5xl0 10 


3746 


0 



Table 4.6. Parameters for Best Distribution fit. 

The next step in the analysis of the total radiance data from the various ROIs is 
the use of quantile-quantile (qq) plots. Plotting two rank-ordered vectors corresponding 
to random observations against each other composes a qq plot. The (x, y) position of the 
first point on the qq plot is deter min ed by the minim um values of the two random 



vectors, the last point is determined by their maximum values, and so on in between. 
Two different random vectors that come from the same underlying statistical distribution 



result in an approximately linear qq plot. Figure 4.8 presents qq plots for the C-130 ROI. 
The ROI data is plotted as X-quantiles and the random variables drawn from the various 



distributions are plotted as Y-quantiles. 
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Figure 4.8. QQ Plots for C-130 ROI. 
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The qq plot helps to determine where the problem lies with a particular 
distribution. For example, in Figure 4.8 none of the distributions predict enough high- 
end observations since the plotted points fall below the linear dashed line at the high end, 
and only the gamma, and to a lesser extent the lognormal have appropriate lower tails. 
The normal and Weibull plots from Figure 4.8 have too many low-end observations since 
their plots are below the linear dashed line. Basically, for the lower end of the qq plot, if 
observations are plotted below the line (Weibull plot in Figure 4.9), then there are too 
many low-end observations and if observations are plotted above (Lognormal plot in 
Figure 4. 1 0), then there are not enough low-end observations. For the upper end of the 
qq plot, if observations are plotted below the linear dashed line (Weibull plot in Figure 
4.9), then that means that there is not enough high-end observations and if observations 
are plotted above (Gamma plot in Figure 4.10), then there are too many high-end 
observations. 



x io 5 lognormal qq plot x 10 6 gamma qq plot 







Figure 4.9. QQ plots for Wings of P-3 ROI. 
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The qq plots presented in Figure 4.9 are for wings of the P-3 ROI. The gamma qq 
plot seems to be the most linear in nature, although the normal qq plot was fairly linear as 
well. This means that the data is likely to be described by either a gamma or a normal 
distribution. 

Figure 4. 10 shows the qq plots for the fence line ROI. The lognormal and normal 
plots did not predict enough low-end observations, while the Weibull predicted too many 
low-end observations. The gamma plot predicted too many high-end observations. The 
normal distribution appears to model the data best, however, some of its lower tail 
predictions need to be more linear to adequate model this data. 

x -jo 5 lognormal qq plot x -jq 5 gamma qq plot 
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Figure 4.10. QQ plots for Fence Line ROI. 



56 



Figure 4.1 1 displays the qq plots for the first background ROI. The lognormal 
appears to model this background data the best. The lower and upper tails of the 
lognormal plot need to be slightly more linear for this distribution to be an accurate 
model. For the most part, all four distributions were fairly linear with only slight 
deviations. 
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Figure 4.1 1. QQ plots for First Background ROI. 



The last qq plot displayed here is the plot of the second background ROI, which is 
predominately composed of sand and dirt. Figure 4.1 1 shows the four qq plots for this 
data. None of the four distributions does an adequate job of modeling this ROI data The 
lower tails of these distributions do not predict the lower-end observations at all, probably 
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due to the highly skewed nature of this background data. The appearance of the qq plots 
for the second background ROI is significantly different than the qq plots of the first 
background ROI. 
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Figure 4. 1 1 . QQ plots for Second Background ROI 



As a comparison to the October over-flight data, a detailed statistical analysis was 
also done on the regions of interest from the June over-flight data. The first total 
radiance histogram with distribution fits is of the C-130 ROI data and is shown in Figure 
4. 12. The results of the statistical distribution fits are the same as the fits done on the C- 
130 data from the October over-flight. The lognormal distribution fit the total radiance 
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Counts 



histogram the best of the statistical distributions tested here and the parameters are shown 
in Table 4.7. 



Distribution 


P 


<T 2 


x 2 


a 


Lognormal 


13.14 


.0843 


8.76 


.31 


Gamma 


- 


- 


17.47 


.006 


Normal 


5.3xl0 5 


2.5xl0 10 


71.66 


0 



Table 4.7. Parameters for distribution fits of June C-130 ROI. 



Total Radiance Histogram from June of C-130 ROI 




Figure 4.12. Total radiance histogram of June C-130 data with Dist. Fits. 



Figure 4. 13 displays the total radiance histogram of the P-3 Wing ROI from the 
June over-flight. Again, the results are very similar to the P-3 Wing data from the 
October over-flight which was shown in Figure 4.4. The lognormal distribution fit the 
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